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Abstract — This paper considers the weighted sum rate (WSR) 
maximized resource allocation (RA) constrained by a system 
sum power in an orthogonal frequency division multiple access 
(OFDMA) downlink transmission system assisted by multiple 
decode-and-forward (DF) relays. In particular, multiple relays 
may cooperate with the source for every relay-aided transmission. 
A two-step algorithm is proposed to find the globally optimum 
RA. In the first step, the optimum source/relay power and assist- 
ing relays that maximize the rate is found for every combination 
of subcarrier and destination, assuming a sum power is allocated 
to the transmission at that subcarrier to that destination in the 
relay-aided transmission mode and the direct mode, respectively. 
In the second step, a convex-optimization based algorithm is 
designed to And the globally optimum assignment of destination, 
transmission mode, and sum power for each subcarrier to 
maximize the WSR. Combining the RAs found in the two steps, 
the globally optimum RA can be found. In addition, we show 
that the optimum RA in the second step can readily be derived 
when the system sum power is very high. The effectiveness of 
the proposed algorithm is illustrated by numerical experiments. 

Index Terms — Orthogonal frequency division multiple access, 
resource allocation, cooperative wireless transmission, relaying, 
decode and forward, convex optimization. 



I. Introduction 

Relay-aided cooperative wireless transmission has been 
attracting intensive research interest lately, motivated by the 
consideration that by cooperative relaying, distributed multiple 
input and multiple output (MIMO) links can be created for 
performance improvement, which is suitable especially for 
applications incapable of installing multiple antennas at the 
same radio device H]. Two protocols, namely amplify and 
forward (AF) as well as decode and forward (DF), have been 
proposed and become the focus of recent research works ||2]. 
In this paper, we consider half-duplex radio devices adopting 
the DF protocol, which carries out a relay-aided transmission 
in two time slots, referred to as the broadcasting slot and the 
relaying slot, respectively. In the broadcasting slot, the source 
emits a symbol, which is received by both the destination 
and the relays. In the relaying slot, some relays first recover 
the source symbol from received signals, then transmit that 
recovered symbol to the destination. Finally, the destination 
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combines the signals received in the two slots, then decode for 
the source symbol. In particular, it has been show in ||2] that 
selection relaying DF, which uses either the direct transmission 
mode without any relay assisting, or the relay-aided mode 
depending on channel state information (CSI), achieves full 
diversity. 

This paper addresses the resource allocation (RA) for 
an orthogonal frequency division multiple access (OFDMA) 
downlink transmission system aided by multiple relays adopt- 
ing selection relaying DF. The motivation behind considering 
OFDMA is that it is a widely recognized multiuser transmis- 
sion technique for current and future wireless systems, thanks 
to its flexibility to incorporate dynamic RA for performance 
improvement |3j. Compared to the conventional OFDMA 
transmission, the relay-aided OFDMA transmission raises 
more complicated RA problems, since it introduces extra tasks 
such as deciding the transmission mode of each subcarrier, de- 
termining assisting relays and the power allocation to them for 
each relay-aided subcarrier, besides assigning destination and 
source power at each subcarrier Therefore, novel efficient RA 
algorithms are solicited for relay-aided OFDMA transmission 
systems. 

For the point to point OFDM transmission aided by DF 
relays, some RA algorithms have been proposed lately. To 
name a few, RA algorithms have been proposed in ||4] to 
minimize the sum power under rate constraints, and in ||5], 
[6] to maximize the sum rate subject to power constraints, 
when only one relay exists and assists with selection relaying 
DF. However, at every subcarrier assigned to the direct trans- 
mission mode, the source does not transmit any symbol in the 
relaying slot, which wastes bandwidth resource. To address 
this issue, rate-optimized RA algorithms which permit source 
transmission in the relaying slot at each subcarrier assigned to 
the direct mode, have been proposed in ll7l- lfT2l . 

For the OFDMA downlink transmission aided by DF relays, 
some RA algorithms have been proposed as well. For example, 
power constrained RA algorithms proposed in llT3l - llT5l and 
lfT6l consider respectively the maximization of the sum rate 
and the weighted sum goodput, whereas the one in ifTTl aims 
at maximizing a metric depending on the rates and queue 
lengths of the source and relays. Using those algorithms, each 
destination may decode the source transmitted signals, each at 
a distinct subcarrier in the broadcasting and relaying (except 
for lfT6l ) slots, as well as the relays transmitted signals, each 
at a distinct subcarrier from a single relay in the relaying 
slot. Note that when the source transmits signals to relays in 
the broadcasting slot, every destination discards the received 
signal replicas and thus spatial diversity is not exploited. In 
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Fig. 1. Example patterns of selecting (a) one relay or (b) multiple relays 
assisting at each relay-aided subcarrier, where a black block at the i-th row 
and j-th column indicates the j'-th relay rj assists the transmission at the i-th 
subcarrier s^. 

ifTSl . a RA algorithm is proposed to maximize the sum utility 
of multiple uplink/downlink data streams aided by a single 
destination adopting selection relaying AF or DF. 

So far, the majority of the proposed RA algorithms, as 
the aforementioned ones, restrict that at most one relay 
can assist the source for every relay-aided transmission. In 
fact, when there are multiple relays available, allowing not 
just one but each of them to be eligible for assisting at 
every subcarrier, can better exploit the degrees of freedom 
in the system for performance improvement. For illustration 
purposes, example patterns of selecting single or multiple 
assisting relays are shown in Figure [T] In fact, these benefits 
have been demonstrated in ifTOl for the OFDM transmission 
to a single destination, and in |[T9l, JlOl for the OFDMA 
downlink transmission with each destination using only one 
fixed subcarrier 

Compared to the above existing works, this paper contains 
the following contributions: 

• we consider the weighted sum rate (WSR) maximized 
RA problem constrained by a system sum power in an 
OFDMA downlink transmission system aided by multiple 
DF relays. In particular, multiple relays may cooperate 
with the source for every relay-aided transmission. 

• we propose a two-step algorithm to find the globally 
optimum RA based on a divide-and-conquer strategy. In 
particular, we show that the optimum RA in the second 
step can be easily derived when the system sum power 
is very high. 

The remainder of this paper is organized as follows. In 
Section II, we describe the considered system. We propose 
the two-step RA algorithm in Section III, then derive the 
optimum RA for the second step when the system sum power 
is very high in Section IV. In Section V, the effectiveness of 
the proposed algorithm is illustrated by numerical experiments. 
Finally, some conclusions complete this paper in Section VI. 

II. System description and RA problem 

We consider an OFDMA downlink transmission system 
from a source to U destinations aided by N DF relays col- 
lected in the set vl> = {Y^\i = 1, • • • , N}. All links are assumed 
to be frequency selective, and OFDM with sufficiently long 
cyclic prefix is used to transform every link into K parallel 
channels, each at a different subcarrier facing flat fading. At 



each subcarrier, the transmission of a symbol is in either the 
direct mode, or the relay-aided mode spanning across two 
equal-duration time slots, namely the broadcasting slot and the 
relaying slot. Due to the OFDMA, each subcarrier is allocated 
to one destination exclusively. As will be described later, in 
the broadcasting slot only the source transmits symbols at all 
subcarriers. In the relaying slot, the source transmits at the 
subcarriers assigned to the direct mode, whereas some relays 
transmit simultaneously at the other subcarriers assigned to 
the relay-aided mode. 

We make the following assumptions for the considered 
system. First, we assume the radio frequency (RF) nonlinearity 
at the source and the relays is negligible, and the carrier 
frequency and symbol timing of the source are perfectly 
synchronized with those of the relays, e.g. with the tech- 
niques in II2TI . Second, every channel in the system remains 
unchanged within a sufficiently long duration, over which 
the RA algorithm can be implemented at a central controller 
knowing precisely the CSI of the system. Furthermore, the 
RA information can be reliably disseminated to the source, 
every relay, and every destination. To date, the related works, 
as introduced in Section I, are based on these assumptions 
as well. As an interesting topic, the RA for the system with 
subcarrier nonorthogonality and non-ideal CSI will be studied 
in our future work. 

Let's consider the transmission of a unit-variance symbol 
s at a subcarrier to a destination, say at subcarrier k to 
destination u. The coefficient of the channel between any two 
of the source, r^, and destination u, are notated according to 
Table H We first describe the transmission in the relay-aided 
mode. Suppose a set of relays, collected in the set TZh{u), 
are selected by the RA algorithm to assist relaying. Using 
the transmit power Ps,„(fc), the source first emits the symbol 
^y Ps,u{k)s, while each relay does not transmit anything in 
the broadcasting slot, as illustrated in Figure |2]a. At the end 
of this slot, both destination u and every relay receive the 
source signal. The signal samples at destination u and can 
be expressed by 

Vu,h{k) = \J PsM{k)\s,u{k)s + nu.h{k) (1) 

and 

2/r,,b(fc) = \l Ps,u{k)\s^^i{k)s + n^.,i,{k), (2) 

respectively, where nu.b(fc) and n^.^\j{k) represent the cor- 
ruption of the additive white Gaussian noise (AWGN) at 
destination u and r^, respectively. 

After decoding and then reencoding s, all relays in TZk{u) 
transmit simultaneously to destination u in the relaying slot, 
which in effect establishes a distributed multiple input and 
single output (MISO) transmission link as illustrated in Figure 
|2]b. Specifically, r.; G TZk{u) transmits a^-.s, where is 



TABLE I 

Channel coefficients at subcarrier fc. 



source to destination u 


source to r^ 


Ti to destination u 




As,r,(fe) 
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(a) In the broadcasting slot (b) In the relaying slot 



Fig. 2. Illustration of a relay-aided transmission at subcarrier k, where 
Tlkiu) = {r2,r3}. 

the complex weight for transmit beamforming and satisfies 
|Q;r;P = Pri,u{k), with P,-.,u{k) denoting the transmit power 
allocated to r^. To have the relays' signals add coherently when 
received at destination u, a^-- = •,yP,.._„(fc)e~-' i^Ti.^ik)) jg 
used, where arg (Ari,ti(fc)) stands for the phase of Ar..„(fc). 
Note that the above described relay-aided transmission enables 
a flexible use of all relays opportunistically through a general 
form of adaptive transmit beamforming, in that TZk{u) and 
{Pii,u{k)\Ti G TZk{u)} are determined dynamically by the RA 
algorithm based on the CSI, as will be developed later At the 
end of the relaying slot, the signal sample at destination u is 
denoted by 

y«,r(fc) = ^ ^Jl^^~Jk)\\,..^u{k)\s + nu.j{k), (3) 

where ri„ represents the corruption of AWGN at destina- 
tion u. 

Finally, destination u combines yu.h{k) and yu,r{k) based 
on the maximum ratio combining (MRC), and the output is 
denoted by 

2/u,c(fc) =y^-Ps,u(fc)(As,«(fc))*2/„,b(fc) + 

( E ^JPn.umK,,u{k)\yyuAk)■ (4) 

r,e7?,&(ii) 

We assume {n^.,\,{k)\i-i G TZk{u)}, nu.h{k), and nu.r{k) 
are independent zero-mean circular Gaussian random vari- 
ables with the same variance a^. By some mathematical 
arrangements, the signal to noise ratio (SNR) associated with 
decoding s from yi-^h{k) at e Tlk{u) in the broadcasting 
slot is 

7r,(fc) =^'s.«(fc)Gs,r.(fc), (5) 

and the SNR associated with decoding s from yu,c{k) at 
destination u is 

7u,c(fc) ^PsA^)Gs,u{k) + 

( E V^^r„n(fc)G,,,«(fc))', (6) 

Where G,^,,{k) = ^-^W^, Gs,„(fc) = and 

Gi-i.u{k) = '''^2 represent the noise power normalized 
channel gains from the source to r^, from the source to 
destination u, and from to destination u, respectively. 



To guarantee reliable decoding at destination u and every 
relay in Tlk{u), the maximum achievable rate at subcarrier k 
is 

i?M,fe,i =min{ min {lii(l + 7,.^ (fc))}, lii(l + 7„_c(fc))} 

= ln(l+7„(fc)) (7) 
in nats/two-slots, where 

7„(fc) = min{7„^c(fc),7r(fc)} (8) 

and 

7r(fc)=^s,n(fc)- min G,.,,(fc). (9) 

As for the direct transmission, we consider a protocol with 
a better use of system bandwidth compared to requiring the 
source to transmit only in the broadcasting slot considered 
in Is), 161. Specifically, the source emits two independent 
symbols with transmit powers Psi.«(fc) and Pg^.uik) in the 
two slots, respectively, and only destination u decodes the 
transmitted symbols from the two received signal samples. 
We assume the AWGN corruption associated with the two 
received samples are independent zero-mean circular Gaussian 
distributed with variance . By simple mathematical arrange- 
ments, the sum achievable rate to destination u in nats/two- 
slots can be evaluated as 

Ru,k,2 = ln(l + Fsi,u(fc)G,,„(A:)) + ln(l + Fs2,u(fc)Gs,„(fc)). 

(10) 

We consider the RA problem of finding the optimum as- 
signment of destination, transmission mode, and source power 
for each subcarrier, as well as the optimum assisting relays 
and their transmit powers for every relay-aided subcarrier, to 
maximize the WSR of the destinations when the considered 
system consumes a sum power no greater than Pf Note that 
for the system under consideration the same subcarrier is used 
by the source and the relays for transmitting a symbol in the 
relay-aided mode. This constraint could however be relaxed, 
and optimized subcarrier pairing could also be implemented. 
This would further increase the degrees of freedom for opti- 
mization. However, it would be more involved to gain insights 
on the RA algorithm design, and that is why in the current 
work, the RA algorithm is designed under this constraint of 
nonoptimized subcarrier pairing. We will nevertheless use the 
insights gained here to guide future work, which will also 
consider subcarrier pairing. 

III. The two-step RA algorithm 

We propose a two-step algorithm to solve the RA problem 
based on a divide-and-conquer strategy. We will first give 
an overview, then develop this algorithm in the following 
subsections. 

A. An overview of the two-step algorithm 

In the first step, the optimum TZk{u), Ps,u{k), and 
{PTi,u{k)\y I'i £ TZk{u)} that maximize Ru,k,i, and the 
optimum Psi.u{k) and Ps2,u{k) that maximize Ru,k,2, are 
found for every combination of subcarrier k and destination u, 
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assuming that a sum power P is used by the transmission at 
subcarrier k to destination u in the relay-aided mode and the 
direct mode, respectively. Such maximized Ru.k.i and Ru.k.2 
are denoted by Ru.k.i{P) and Ru.k.2{P), respectively. The 
associated algorithm will be developed in Section III.B. 

To simplify the RA in the second step, two user sets, namely 
Uoik) = {u\yp e [0,Pt],Ru.k.2{P) > Ru.k.liP)} and 
Unik) ^ {u\yp e [0,PtlRu,kAP) > Ru,kAP)h are 
found for every subcarrier k. Once a destination u G U-D{k) 
(resp. u G lAnik)) is allocated with subcarrier k, the direct 
mode (resp. the relay-aided mode) should always be used to 
maximize the WSR, since it results in a rate no smaller than 
the relay-aided mode (resp. the direct mode), independently 
of the sum power allocated to this subcarrier 

To formulate the maximum WSR with the Ru.k.i{P) and 
Ru,k,2{P) derived in the first step, we define for every 
destination u in Uj{{k) (resp. U-D{k)) a binary variable tu.k.i 
(resp. tu,k,2) and a nonnegative variable Pu.k.i (resp. Pu,k,2), 
where tu,k,i = 1 (resp. tu,k,2 = 1) indicates that subcarrier k 
in the relay-aided (resp. direct) mode is allocated to destination 
u, and Pu,k,i (resp. Pu,k,2) represents the corresponding 
sum power allocated to subcarrier k. For every destination 
u ^ U-D{k) U U^k), tuMS, tu,k,2, Pu,k,i, and Pu,k,2 are 
defined with the same interpretations as explained above. 
Note that tu,k,i and Pu.k.i are defined for every destination 
u ^ U^ik), and tu,k,2 and Pu,k,2 are defined for every 
destination u ^ UB.{k). Now, the maximum WSR can be 
expressed by 

K 

^ WutuMsRu,k,l{Pu,k,l) + 
k=l u^Uoik) 

Wutu,k,2Ru.k.2{Pu,k,2)) (11) 

where Wu > satisfying Y^^=i = 1 is the weight assigned 
by system designers to the rate of destination u. In particular, 
increasing Wu leads to a higher priority given to destination 
u. 

In the second step, for every subcarrier k the optimum 

{tu,k,i,Pu.k.i\yu i Uoik)} and {tu,k,2, Pu.kM'^ u ^ Unik)} 
that maximize / are found subject to the system sum power 
constraint 

K 

J2 tuMsPuMS + in,k,2Pu.K2) < Pt, (12) 

k=l u(f_Ur,(k) uiUn(k) 

as well as the constraint 



iu,k,i + 



E 



tu,k,2 < 1,V fc, 



(13) 



due to the OFDMA. It is important to note that there exist 
default constraints that V fc, u, TZk{u) should be a subset 
of VP, and every power (resp. indicator) variable should be 
non-negative (resp. binary). This problem, consisting of both 
binary and continuous optimization variables, is not convex 
since the feasible set for the binary variables is not convex. 
In Section lll.C, we will develop an algorithm to find the 
globally optimum solution. Combining the RAs found in the 



two steps, the globally optimum RA for maximizing the WSR 
can be found. 

B. RA algorithm in the first step 

First, we consider the maximization of Ru,k,2 under the 
constraint Pg^ „(/c) +Ps2,«(^) = P- According to the Jensen's 
inequality, the maximum Ru.k.2 in this case can be easily 
found as 

i?«,fc,2(P) =21n(^l + G,,„(fc)|^ , (14) 

and the optimum RA is to allocate the sum power P equally 

to Psi,„(fc) and Ps2,«(fc)- 

Next, we consider the maximization of Ru,k.i with respect 
to TZk{u), Psu{k), and {P,-. „(fc)|Vri 6 TZk{u)} constrained 
by 



.(fc) = P 



(15) 



To facilitate derivation, we define G7jj,(„) ^(fc) — 
'^T-eiZkiu) Gvi.u{k), which represents the sum of the channel 
gains from all assisting relays to destination u. 4'^ is defined 
as the set incorporating all relays sorted in the increasing order 
of Gs,vi{k), and the i-th relay in VP^ is denoted by rfc(i). In 
particular, Gs.r^(N){k) = maxi-.g^ Gs,ri (fc)- We define ^'^(i) 
as the set containing all relays m with indices from i up 
to N. When Gs,u(fc) < Gsj^{N){k), Xu.k is defined as the 
smallest i satisfying Gs.u{k) < Gs.i^(i){k). 

By means of intuitive figure illustrations, the procedure of 
maximizing Ru,k,i is derived and put in Appendix A for 
clarity. In particular, we find that 



Ru^kAP)=H^ + Gu.i{k)P), 



(16) 



where Gu,i{k) can be evaluated by one of the following 
formulas: 

1) when Gs.u{k) > Gs^r-^{N){k), G„,i(fc) = Gs,r^(Ar)(fc). 

In this case, V P G [0,Pt], Ru,k,2iP) > Ru,kAP) 
holds. 

2) when Gq,^(^^^ ^^A^^) < Gs.„(fc) < G,,,,.^(Ar)(fc), 
G„,i(fc) = Gs.uik). In this case, V P G [0,Pt], 

Ru,k,2{P) > Ru.k.iiP) holds. 

3) when Gs,„(fc) < Gs,,^(^N^{k) and Gs,„(fc) < 

G^S/k{x^,k).uik), 



GuAk) = 



G's,rfc{6)(fc)G* (6) u(fc) 
u,k<b<y,c,k G-q,i^(b),u{k) + Gs,r^{b){k) - Gs,u(fc) ' 

(17) 



where yu^k is the greatest i satisfying G^^,(i) „(fc) > 
Gs,ii(fc)- In this case, G„.i(fc) > Gs,u(fc) always holds 
as shown in Appendix A. Suppose b = Zu.k is the 
maximizer for the right-hand side of ( fTTt . the optimum 
TZkiA is 5'a:(z„,a;). The optimum Ps.u{k) is computed 
by (|46] | with b = Zu^k, and the optimum P^-^k), 
Vl-i G *fe(2:„,fe) is 

P..Ak) - (18) 



k {Zii.k) 



Ak) 
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The above analysis reveals that, the relay-aided mode with 
the optimum RA in effect transforms all channels at subcarrier 
k into a source to destination u channel with normalized gain 
Gu,i{k) and half of the system bandwidth. As a matter of 
fact, only one symbol can be sent during the two slots, while 
the system bandwidth can actually support sending two inde- 
pendent symbols per two slots. When Gu.i{k) < Gs.u{k), the 
direct mode should always be used. When G„.i(fc) > Gs^u{k), 
using the relay-aided mode leads to an increased channel gain 
but sacrificing half of the bandwidth, compared to using the 
direct mode. In this case, Ru.k.i{P) > Ru.k.2{P) holds if 

P < ""'"(g!!.;^)^""'" . otherwise RumAP) > 
which means that the relay-aided and direct modes should be 
used for the low and high power regimes, respectively. The 
interpretation is that, in the low power regime, it is more 
beneficial to increase the received power, while in the high 
power regime, it is better to increase the number of channel 
uses per time unit ||22| . Based on the above analysis, U£,{k) 
and Uji{k) for every subcarrier k can be expressed as 

Uuik) = {u\Gu,i{k) < GsAk)} (19) 

and 

Unik) = {u\G^Ak) > G^AklP. < ^^^'^^^^(^f^} ■ 

(20) 

C. RA algorithm in the second step 

We propose an algorithm to solve the RA problem in 
the second step based on the following strategy. First, for 
every subcarrier fc, tu,k.i and tu^k,2 are relaxed to be real 
variables between and 1. Now, tu,k.i (resp. tu.k.2) can be 
interpreted as the fraction of the whole duration allocated 
to transmitting at subcarrier k to destination u in the relay- 
aided (resp. direct) mode, and r]u,k,i = tu,k,iRu,kAPu,ks) 
(resp. riu,k,2 = i«,fc,2-R«,fe,2(-P«,fe,2)) can be interpreted as 
the rate averaged over the whole duration for transmitting to 
destination u at subcarrier k in the relay-aided (resp. direct) 
mode. Then, the relaxed RA problem can be transformed with 
change of variables into a convex optimization problem, and 
its globally optimum RA can be found as will be shown later 
Most interestingly, we will show that every optimum tu,k,i 
and tu.k.2 for the relaxed problem are still equal to either or 
1, which means that the globally optimum RA to the relaxed 
problem is also globally optimum to the original RA problem. 

To make the change of variables, Pu.k.i is substituted with 
^"'^■^ for every u ^ U^ik), and Pu.k.2 with ^"'^'^ for every 
u ^ U^{k). Now, the sum power constraint, r/„ ^ 1, and 7711.^.2 
are respectively expressed as 

K 

fc=l u(Uo{k) ufURik) 

Vu,k,i = tu,k,i In ( 1 + G'„,i(fc) ^"''''^ ) , (22) 

n, 1 (i I Gs,u{k) Pu,k,2\ 

?7^i,/c,2 = 2 t„,fc,2 In IH . (23) 

V ^ lu,fe,2 / 



Note that 77^ ^ 1 (resp. 77^ ^ 2) is a concave function of 
tu,k,i > and pu,k,i (resp. tu^k,2 > and Pu,k,2), since 
it is a perspective of the concave function Ru,k.i{Pu,k,i) 
(resp. Ru,ka{Pu,ka)) (refer to page 89 in 123] for more 
details). One delicate issue is that after the change of variables, 
Vu.k.i (resp. 7]u.k.2) can not be evaluated at tu,k,i = (resp. 
tu,k,2 — 0). To address this issue, we expand the domain 
of riu,k,i to incorporate tu,k.i = 0, and define riu,k,i = 
when tu.k.i = 0. After the expansion and definition, riu,k,i is 
still a concave and continuous function of tu.k.i > 0, because 
lim^^^ ^ i-!-o 'yii.fc,! = 0- Motivated by the same consideration, 
we expand the domain of riu.k.2 to incorporate tu.k.2 = 0, and 
define 'riu.k.2 = when tu^k,2 = 0, so as to ensure r]u,k,2 is a 
concave and continuous function of tu,k,2 > 0. 

Based on the above analysis, the relaxed RA problem with 
new variables, i.e., maximizing / subject to the constraints 
(flJl l and (l2Tl i. is a convex optimization problem. Obviously, 
this problem satisfies the Slater constraints qualification, i.e. 
there exists at least one feasible solution satisfying all inequal- 
ity constraints strictly. This justifies the zero duality gap for 
the problem li24J . Therefore, the dual method can be used 
to solve this problem for the globally optimum RA ll25l . 
Specifically, dual variables are defined for certain constraints, 
then an iterative algorithm consisting of an inner loop and an 
outer loop is implemented. In the inner loop, the Lagrangian 
maximization problem (LMP), which is to find the optimum 
RA variables that maximize the Lagrangian, is solved with 
the dual variables fixed. In the outer loop, the optimum dual 
variables are found by iteratively updating the dual variables 
with the subgradient method, until the Karush-Kuhn-Tucker 
(KKT) conditions are satisfied. The optimum RA variables for 
the LMP given the optimum dual variables, are the globally 
optimum solution for the relaxed RA problem. 

To use the dual method, a nonnegative dual variable p is 
introduced for the system sum power constraint (l2Ti . and the 
associated LMP is 

maxL(^) ^f + piPt-P,) 
s.t.O, (24) 

tuMS >0, Pu.k.i > 0,Vfc,VM^ WD(fc), 

tu.k.2 >o, Pu.k.2 >o,yk,yu(^UR{k), 

where L{p) represents the Lagrangian given p, and P^ is the 
left-hand side of ( 1211 1. The optimum dual variable denoted by 
p*, must satisfy the KKT conditions consisting of (a) p* > 0, 
(b) /i*(Pt - P^{p*)) = 0, and (c) P,(/i*) < Pt, where P,{p) 
represents the P^ evaluated with the optimum power variables 
to (l24l i given p 1231 . 1241 . It is important to note that p* 
must be strictly positive, which can be justified as follows. 
Suppose p* = 0. In this case, the maximum L{p*) subject 
to the constraints of ( l24l i is infinity, since /, as a function of 
"Hu.k.i and riu.k,2, is monotonically increasing with pu.k.i and 
Pu.k.2, which can be increased unboundedly while remaining 
feasible to (l24l i. This means that Px{p*) is infinity, which 
contradicts (c) of the KKT conditions. Therefore, p* ^ 0, and 
the KKT conditions are reduced to /i > and Pt — Px{p*)- 
Let's now find the optimum RA variables to (l24l i given p > 
0. To this end, we find for every subcarrier k,\/ u ^ Uu{k), the 
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optimum pu.k.i given tu.k.i, and Vu ^ U^{k), the optimum 
Pu,k,2 given tjj.fc.2, to maximize After mathematical 

arrangements, we can show these optimum values are 



Algorithm 1 Procedure to solve 




Pu,k,i 

I 0+ _ Wu 

G=,„(fc) 

where — max{x,0}. Using 



Hi = 1, 
if j = 2, 

is reduced to 



(25) 



K 

E 

k=l 



Lkip) + pPt- 



where 

Lk{p) = 



E 

u0A-D(k) 



tu.k.lXkiu) + ^ tu,k,2Ykiu), 
ufUiiik) 



(26) 



(27) 



and 




(28) 



(29) 



Now, the problem of finding the optimum {i„,fe,i|VM ^ 
Z^D(fc),Vfc} and {^„,fc,2|Vu ^ Z^R(fc),Vfe} for (|2lli' can be 
decomposed into K subproblems, the fc-th of which is to find 
the optimum {t„,fc,i|V'u ^ Z^D(fc)} and {t„,fe,2|Vu ^ Z^R(fc)} 
for maximizing Lk{p) subject to the constraints in (l24l i. It can 
be readily shown that the optimum {tu.k.i\^u ^ Ui){k)} U 
{tu,k,2\yu ^ Z//R(fc)} has all entries equal to 0, except for one 
entry tu^,k,ik equal to 1. Obviously, {uk,ik) is the {u,i) with 
the maximum metric mu,i{k), expressed as 



Xk{u) 
Yk{u) 



if i 
if i 



(30) 



Note that when computing mu.i{k), i can only be 1 (resp. 2) 
if w G UYi{k) (resp. u G U-D{k)), while i may be either 1 or 2 if 
u ^ UYi{k) UlA-Dlk). There might exist multiple combinations 
of (w, i) corresponding to the maximum mu^i{k). In this 
case, any combination of them can be chosen to be {uk,ik)- 
According to (IZSl l. the optimum power variables related to 
subcarrier k are all equal to zero, except for Pu^,,k,ik computed 
with 



i(fc) 
1 



if Zfe = 1, 
if jfe = 2. 



(31) 



The procedure of finding the optimum RA variables to (l24l 
is summarized in Algorithm [T] Next, we address the problem 
of finding p* in the outer loop iteratively. To speed up the 



for = 1 to do 

Vit and the associated i, compute mu^i{k) defined in l |30t 
{uk,ik} = argmaxu,; mu,i(fe); 
tuk,k,tk = 1 and compute Puk,k,ik by lUD; 
Pu,k,i = and tu,k,i = if either u ^ Uk or i / ifc. 
end for 



Algorithm 2 The RA algorithm in the second step 

Find /iu satisfying Pv{p\j) = Pu 
Find satisfying PL(AiL) = Pu 

For each in S = {^l + ]^(mu - ML),n = 0, 1, • ■ • , A^s — 1}, 
use Algorithm [T] to find the optimum RA variables to l l24t . and 

compute Px{p); 

Initialize /i with the /i G S satisfying (/i) < Pt and having the 

minimum Pt — Px{i-i); 

repeat 

Update p with [p - S{Pt - P^p))]-^; 
Use Algorithm [T] to find the optimum solution for i24\ : 
Compute Px{p); 
until > and < Pt - Pxip.) < e 

The optimum solution for ( 1241 found in the last iteration corre- 
sponds to the optimum sum power and mode for each subcarrier 
in the second step. 



convergence rate, /i is first initialized by a value close to p*. 
To this end, we define 



fc=l 

K r 



2 max,, (it;,, I , ,1"'' 
^I-L-^ - ML(fc) 



fc=i 



where 



-^^u(fc) = max < max 



1 



max 



U^WD(fe) G„,l(fc)'u^WR(fc) G's,«(fc) 



il/L(fc)=min< min 



uiUuik) Gu.lik) ' u^Unik) Gs,„(fc) 



(32) 
(33) 

(34) 
(35) 



It can be readily shown that P'l{ij*)<Px{p*)^P\]{p*) 
always holds. Suppose /iu and /iL satisfy Pu(mu) = Pt 
and Phiph) ~ Pt, respectively. Since Pu(/^u) = -Pt = 
Px{p*)<Ptj{p*) and Ptj{p) is a non-increasing function of 
/i, /iu > P*- Similarly, we can show /i* > /il, thus /i* is 
confined in the interval [/iL,/iu]- To initialize /i with a value 
close to /i*, this interval is sampled uniformly at Ng points to 
build the set S ^ {fii^ + f-ipu ~ fii^),n ^ 0,1, N, - 1}. 
Given each p in S, ( l24b is solved and Px{p) computed, then 
/i is initialized with the /i G 5* satisfying Px{lP) < Pt and 
having the minimum Pt — Px{lP}- 

After initialization, /i is iteratively updated based on the sub- 
gradient method, i.e., /i is updated with [p — S{Pt — Pxip))]^ 
where S is a sufficiently small value to guarantee convergence, 
until /(* is found when Pt = Px{p-) and /j > 0. The RA 
algorithm in the second step is summarized in Algorithm |2] 
Due to numerical issues, we regard the KKT conditions to be 
satisfied when /i > and < Pt — Pxip) < where e is a 
very small positive value. 
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IV. The optimum RA in the second step for a 

SPECIAL CASE 

We will show the optimum RA in the second step can be 
easily derived when Pt is very high so that the associated 
satisfies 



source(square), relays(circle), destinations(diamond) 



1 



1 



V fc, »max{max{— — jTT.-p, — 77T//' (36) 



Vfc, 



min,, w,, 



>max{max{G„,i(fc),Gs,„(fc)}}. (37) 



Since (|36l l holds and fi* < fJ-\j, the optimum sum power P^. 
for subcarrier k satisfies 



Vfc, Pk 



2iOu 



if ik = 1 



(38) 



according to dsTT l. where {uk, ik) corresponds to the optimum 
destination and mode for subcarrier fc found by the RA 
algorithm in the second step. From (|36] |, (l37t . and (l38T l, the 
conditions 

Vfc, u, PfeG,,„(fc) » l,Pfe » G,,„(fc), (39) 
Vfc,u,PfcG„,i(fc-) » l,Pk » Gusik), (40) 

hold. This means that in the high power regime, even though 
the optimum sum power for each subcarrier is not precisely 
known, it is for sure that V fc, w, the following approximations 
hold: 



Mu,k,2{Pk) ~2u;„(ln(Pfe) +ln 



Gs,u{k) N 



2wuHPk), 
(41) 

WuRu,k,i{Pk) «w„(ln(Pfe) + lnG„,i(fc)) ww„ln(Pfc). 

(42) 

By comparing W\\ and (|42] |. it can easily be seen that the 
direct mode for each subcarrier fc is the optimum for maxi- 
mizing the WSR, independently of the destination assigned. 
All this confirms the former interpretation that, for the high 
power regime, increasing the number of channel uses per time 
unit is more beneficial than increasing the received power 
If equal weights are used for all destinations, the optimum 
destination for subcarrier fc is the one with the maximum 
Gs,ti(fc)- Otherwise, the optimum destination for all subcarriers 
is the same and given by the one with the maximum weight 
Wu- In either case, the optimum sum power to subcarrier fc is 
Pfc w in order to satisfy the KKT conditions according to 
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Fig. 3. The OFDMA system considered in numerical experiments. 



as — o-Qe^^* [i — 0,1,-- - ,5). Moreover, the average 
received power attenuation is equal to ad~^ at a distance of d. 
Note that IQXogiQ^a), representing the log-normal shadowing 
effect, is Gaussian distributed with zero mean and variance 
equal to dB. This means that the transmit power is attenuated 
by 30 dB in average at a distance of 10 m. 

To illustrate the effectiveness of the optimum RA found 
for the proposed protocol, we compare it to the optimum RA 
for a reference protocol. Both protocols are the same except 
that with the reference protocol, at every subcarrier in the 
direct mode the source only transmits in the broadcasting 
slot (and remains idle during the relaying slot). Interestingly, 
when the destinations use equal weights, the optimum RA 
for the reference protocol can easily be derived on a per 
subcarrier basis. For subcarrier fc, the optimum destination 
Uk is the one with the maximum Gu{k) where G„(fc) — 
max{Gti.i(fc), Gs,ti(fc)}, i-e., the relay-aided mode is optimum 
only if G„j..i(fc) > Gg.Ukik)- The optimum per subcarrier 
sum power can be calculated according to water filling of the 
system sum power Pt to K parallel channels with normalized 
gain Guf, (fc) for the fc-th channel. It is important to note that 
for the reference protocol, the above RA might not be optimum 
when destinations use unequal weights. In this case, the RA 
needs to be optimized jointly over all subcarriers with the 
same two-step optimization strategy as proposed in Section 
III, which is not developed here due to space limitation. On 
the contrary, the RA for the proposed protocol always needs 
to be jointly optimized over all subcarriers. 



V. Numerical experiments 

In numerical experiments, we consider a multiple DF relays 
aided OFDMA downlink transmission system with = 4 
relays. The source is allocated at the origin, and the relays 
ri, r2, ra, and 14 are located at coordinates (—15,-5), 
(—5, —5), (5, —5), and (15, —5), respectively. Note that all 
the aforementioned coordinate-related values have the unit 
of meter The random channel between every two of the 
source, the relays, and the destinations is generated based on 
a 6-tap delay line model, with the i-th tap's amplitude as a 
circular Gaussian random variable of zero mean and variance 



A. Results for a random realization of channels 

For clarity of presentation, we first assume if = 32 
subcarriers are used, <t^ = —30 dBW, [/ = 4 destinations 
exist with equal weights, A^^ = 100, 5 = lO^^l^u — /xl), and 
e = 0.1. For the randomly generated destination coordinates 
shown in Figure [3] we have produced a random realization 
of all channels (not shown here due to space limitation), 
over which the optimum RAs for the proposed and reference 
protocols are evaluated. For every destination u, the Gu.i{k) 
computed by the proposed RA algorithm in the first step, and 
Gs,u(fc) are shown in Figure |4] for every subcarrier fc and 
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Fig. 4. (a) Gs.u (fc) and (b) j (fe) computed by the proposed RA algorithm 
in the first step, at each subcanier k for evei7 destination. 
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(b) 

Fig. 5. When Pt = 35 dBW, (a) the visualization of W]3(A;), V k, where a dot 
marked at the coordinate (k,u) indicates destination u £ UY){k), otherwise 
w ^ U-D{k). (b) the visualization of U-[{ {k), V k, where a dot marked at the 
coordinate (k,u) indicates destination u £ U^{k), otherwise u ^ U^{k). 



destination. 

First, we set Pt = 35 dBW which corresponds to the low 
power regime, since if Pt is uniformly allocated to the subcar- 
riers of the source, the average SNR is 2.17, 5.80, —2.38, and 
5.23 dB at four destinations, respectively. The corresponding 
i^D(fc) and UYi{k) are found and visualized in Figure |5] for 
every subcarrier The optimum RA and the corresponding 
WSR have been evaluated for the proposed protocol with the 
RA algorithm developed in Section III, and for the reference 
protocol with the aforementioned method. The results are 
shown in Figure |6] We can see that for the reference protocol, 
the optimum destination and mode at every subcarrier k are 
Wfe = 4 with the maximum Gu{k) = ma.x{Gu,i{k),Gs.u{k)} 
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Fig. 6. For the reference and proposed protocols, the optimum RA found in 
the second step for each subcarrier k when Pt = 35 dBW. 



and the relay-aided mode, respectively. The uniform sum 
power allocation to all subcarriers is optimum. The reason is 
that G„,i(fc) for destination 4 varies slightly over all subcarri- 
ers. It is very interesting to note that, compared to the reference 
protocol, assigning a few subcarriers to destination 2 or A in 
the direct mode with a higher per subcarrier sum power, leads 
to a higher optimum WSR for the proposed protocol, even 
though at those subcarriers Gg for destination 2 or A is 
smaller than Gu.i{k) for destination 4. For both protocols, 
the optimum Pst.u{k) and {Pi-. ,j(fc)|V r^} computed by the 
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The optimum source/relay power at each subcarrier k for the reference protocol 
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Fig. 7. The optimum source and relay power in the broadcasting slot for the 
two protocols when Pt = 35 dBW. 

Visualization of Up(l^) 
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Fig. 8. The visualization of lA-R ik) for each subcarrier k when Pt = 60 
dBW, where a dot marked at the coordinate (k,u) means that destination 
u € UYi{k), otherwise u ^ Wa(fc). 



RA algorithm in the first step are shown in Figure |7] It is 
shown that 7-3 and r4 assist relaying simultaneously at a few 
subcarriers for both protocols. 

Next, we set Pt = 60 dBW, which satisfies the conditions 
( |36] l and ( [37] i. For each subcarrier fc, the U^ik) is the same as 
when Pt = 35 dBW, since it is determined independently of 
Pt according to the conditions in iT% . The Uii{k) is visualized 
in Figure |8] for every subcarrier Now, lAYi{k) contains none 
of the destinations at most of the subcarriers, since Pt is 
so high that the conditions in (|20] i are not satisfied by any 
destination at many subcarriers. The optimum RAs and WSRs 
have been evaluated for the two protocols and shown in 
Figure |9] It is shown that the optimum RA for the proposed 
protocol leads to a much higher WSR compared to that for the 
reference protocol. For the reference protocol, the optimum 
mode and destination at each subcarrier are the same as when 
Pt = 35 dBW. This is because when the destinations have 
equal weights, the optimum mode and destination for the 
reference protocol only depends on the Gs,u{k) and Gu.i{k) 
of the destinations. The uniform sum power allocation to all 
subcarriers is still optimum, as explained for the case when 
Pt = 35 dBW. For the proposed protocol, the optimum 
destination and mode for every subcarrier k are the destination 
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Fig. 9. For the reference and proposed protocols, the optimum RA found in 
the second step for each subcarrier k when Pt = 60 dBW and destinations 
have equal weights. 



with the maximum Gs,u{k) and the direct mode, respectively, 
and the uniform sum power allocation is also optimum. For the 
reference protocol, the optimum Psi.u{k) and {Pi..,„(/s)|V r.;} 
computed by the RA algorithm in the first step are shown 
in Figure [TO] It is shown that and r4 assist relaying 
simultaneously at a few subcarriers. Note that for the proposed 
protocol, the optimum sum power at subcarrier k is equally 
allocated to Pg^ ^(fc) and Pg^ ^(fc), which is not shown here. 
For the considered channel realization, the above observa- 
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tions illustrate the effectiveness of the proposed protocol and 
algorithm to adapt the RA dynamically for WSR maximiza- 
tion. They also support the analysis of Section IV for the 
high power regime. When Pt = 60 dBW and the weights for 
destination 1, 2, 3, and 4 are 0.4, 0.2, 0.2, 0.2, respectively, the 
optimum RA for the proposed protocol has also been evaluated 
and shown in Figure [TT] We can see that destination 1 with 
the maximum weight and the direct mode is optimum for each 
subcarrier. The uniform sum power allocation is optimum. 
These observations illustrate the effectiveness of the analysis 
of Section IV for unequal destination weights in the high 
power regime. 

B. Results for a set of random realizations of channels 

For the comprehensive illustration of scenarios close to 
reality, we have generated 1000 random channel realizations. 
For each realization, the optimum RAs and the corresponding 
WSRs for the reference and proposed protocols are evaluated. 
We assume K — 64 subcarriers are used, = —30 dBW, 
U = 8 destinations exist with equal weights, Ng = 100, 
S = 10~'^(^u — /il), and e = 0.1. To produce each channel 
realization, the destination coordinates are first generated in the 
region {{x,y)\ — 10 < x < 10, —30 < y < —10} uniformly 
and independently of each other, then the random channels are 
produced. 

For better clarity, the optimum WSRs for only the first 100 
channel realizations, and the average WSR for all channel 
realizations are shown in Figure [12] for both protocols when 
Pt is 35 and 60 dBW, respectively. It is found that for 
every channel realization, the optimum WSR of the proposed 
protocol is equal to or greater than that for the reference 
protocol. The reason is that the set of feasible RAs searched 
for the optimum RA of the reference protocol, is a subset 
of that of the proposed protocol, since extra constraints are 
imposed on the RA optimization for the reference protocol, i.e. 
Ps2.u{k) must be zero for every subcarrier in the direct mode. 
Compared to the reference protocol, the optimum RA for the 
proposed protocol leads to either the same or a slightly greater 
optimum WSR for each channel realization when Pt = 35 
dBW. However, a much higher optimum WSR results from 
the optimum RA and the proposed protocol for the majority 
of channel realizations when Pt = 60 dBW. As a result. 
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Fig. 10. The optimum source and relay power in the broadcasting slot for the 
reference protocol when Pt = 60 dBW. Note that for the proposed protocol, 
the optimum sum power at each subcarrier k is equally allocated to Psi,u(fc) 
and Ps2,u(fc), which is not shown here. 
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Fig. 11. For the proposed protocol, the optimum RA found in the second 
step for each subcarrier k when Pt = 60 dBW and destination 4 has the 
maximum weight. 



the average optimum WSRs for the two protocols are very 
close when Pt = 35 dBW, while the proposed protocol and 
RA result in a much higher average optimum WSR than the 
reference protocol when Pt — 60 dBW. As already said, this 
is due to the better exploitation of channel uses by the direct 
mode of the proposed protocol in the high power regime. 

For both protocols, the rate cumulative distribution functions 
(CDF) and average rates of user 1 are shown in Figure [T3] 
when Pt is 35 and 60 dBW, respectively. Note that for the 
same protocol and Pt, all other users have similar rate CDFs 
and average rates as user 1, since all users' coordinates are 
independently and identically distributed in the same region. 
It can be seen that both protocols lead to almost the same rate 
CDFs for user 1 when Pt = 35 dBW. Nevertheless, using the 
proposed protocol makes it more likely to transmit with high 
rate to user 1 when Pt = 60 dBW. This means that for a fixed 
outage probability, the proposed protocol leads to a higher 
outage rate for user 1 than the reference protocol. As a result. 
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Fig. 12. The optimum WSRs for the first 100 channel realizations for 
the reference and proposed protocols, and the average WSR for all channel 
reahzations. 



the average rates of user 1 for both protocols are almost the 
same when Pt = 35 dBW, while the proposed protocol leads 
to a much higher average rate of user 1 than the reference 
protocol when Pt = 60 dBW. These observations illustrate 
the effectiveness of the proposed protocol and algorithm for 
the random channel realizations. 

VI. Conclusion 

We have considered the WSR maximized RA problem 
constrained by a system sum power in an OFDMA down- 
link transmission system assisted by multiple DF relays. In 
particular, multiple relays may cooperate with the source for 
every relay-aided transmission. We have proposed a two-step 
algorithm to optimize the RA. Furthermore, we have shown 
that the optimum RA in the second step can be easily derived 
when the system sum power is very high. The effectiveness 
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Fig. 13. User I's rate CDFs and average rates for the reference and proposed 
protocols. 



of the proposed algorithm has been illustrated by numerical 
experiments. 

Appendix A 

Maximization of Ru.k,\ in the relay-aided mode 

When Gs^u{k) > Gs^r^{N){k), Jr{k) < 7„,c(fc) always 
holds, thus Ru^kj = ln(l + Ps,«(fc) ■ mini.^gK^(„) Gs^iik)), 
which means that Ru.k.i is equal to the maximum rate allowed 
for reliable transmission from the source to all relays in 
TZk{u). In this case, the maximum Ru.k.i is Ru.k.i{P) = 
\n{l+Gs.rkiN)ik)P)- To achieve it, Tlk{u) and Ps.u{k) should 
be equal to {rfc(A^)} and P, respectively. 

Now we consider the case when Gs_^^t^isi){k) > Gs.u{k). 
The maximization of Ru.k.i is equivalent to the maximization 
of 7„(fc), as the minimum of ^u,c{k) and 7i(fc). Note that 
7i.(fc) depends on Ps.u{k) and TZk{u), while besides them 
^u.c{k) depends on {Pri,u(fc)|ri G TZk{u)} as well. To 
maximize 7„(fc) given P, ^u,c{k) can be first maximized with 
Ps,u{k) and TZkiu) fixed, by finding the optimum allocation of 
the sum relay power P — Ps.u{k) to {Pii.u{k)\ri 6 TZk{u)}. 
Denoting such maximized ■yu,c{k) by Jucik), the optimum 
{Ps.u{k),Tlk{u)} that maximizes uiin{j'^ c(^)j7r(^)} i^ '^^en 
found. Finally, that {Ps.u{k), TZk{u)}, as well as the associated 
optimum allocation of P — Ps.„(fc) to {Pr^.u{k)\Ti G TZk{u)}, 
constitute the optimum solution to maximizing 7„(fc) given 
P. 

To facilitate derivation, we assume the relays in Tlk{u) are 
sorted in the increasing order of Gg.ii (k), and the first relay in 
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TZk{u) is rfe(6), namely the 6-th relay in ^E'^. This means that 
7^fc(li) is a subset of *fc(6), and G7^^(„).„(fc) < G*^(6),„(fc). 
It is important to note that 6*5 ^^(2) {k) is an increasing function 
of i, while G^^(i).„(fc) is a decreasing function of i. 

Now, 7r(fc) = Ps,u(fc)G5 and according to the 

Schwartz Inequality, 

7u.c(fc) < PB,..(fc)Gs,„(fc) + {P- Ps,^,(fc))GTC,(„),„(fc), (43) 

where the equahty holds when 

(P-P,,„(A:))Gr.,„(/c) 



(44) 



SrieKfc(M) G'ri,ti(^) 

Obviously, the right hand side of ( |43] l is equal to 7^ c(^)- 
As Ps^u{k) increases from to P, {Ps^u{k),^r{k)) moves 
along the line from the origin to A, whereas {Ps,u{k), 7^ c(^)) 
along the line from B to C in the coordinate space shown in 
Figure [141 where the coordinates of A, B, and C are given in 
Table Note that C is fixed, while both A and B depend on 
TZk{u). In particular, with TZk{u) and P fixed, the maximum 
of min{7^ j,(fc), 7i.(fc)} with respect to Ps.u{k), denoted by 
'y'uik), is achieved at A, C, and D for a given TZkiu) satisfying 
respectively 

1) 6 < Xu^k, where is the smallest i satisfying 
Gs,,^{:i){k) > Gs,u{k). 

2) b> Xu,k and G-ji^ (k) < G,,„(fc), 

3) 6 > Xu,k and G-ji^ (k) > G,,„(A:). 

We can see that only in case 3, j'^ik) is greater than 
Gs,u{k)P- Among all 7?,fc(it)'s having the same first relay 
as rfc(fe) and corresponding to case 3, the TZk{u) with the 
maximum G7jj.(,u) „(fc) corresponds to the greatest height 
of D, which suggests that '^k{b) is the one achieving the 
maximum 74(fc) in that set. The corresponding 7^(fc) and 
Ps,u{k) are computed respectively as 



<{k) 



Gs,r^{b){k)Gq,^(^b).u{k)P 

G*^(b),„(fc) + Gs,r^(b)ik) - Gs,u{k) 



(45) 



and 



PsAk) 



G^^{b),u{k) 



G-iU 



-P. (46) 



.(6),«(fc) + Gs,i.^(b)(fc) - GsAky 

Based on the above analysis, we can decide what the 
optimum TZk{u), Ps,u{k), and P^.^k) should be. When 
G^k{x^ k),u{k) < Gs,u(fc), every possible TZk{u) corresponds 
to either case 1 or 2, which means that the 7(j(fc) maximized 
over TZk{u) and Ps,u{k) is equal to Gs,u{k)P. In this case, 
the maximum Ru.k.i is Ru,k,i{P) = ln(l + Gs,u{k)P), and 
to achieve it Ps.u{k) should be equal to P. 

When G*^(^^ j^)_„(fc) > Gs.u{k), every possible TZk{u) 
corresponds to one of cases 1, 2, and 3. Obviously, the 
optimum TZk{u) should correspond to case 3. To have TZk{u) 
corresponding to case 3, b must be between x„ ^ and yu.k, 
where j/„_fc is the greatest i satisfying G\ij^(^iyu{k) > Gs,u{k)- 
As pointed out before, ^"^-(6) is the TZk{u) achieving the 
maximum 74(fc) among all TZkiuYs having the same first relay 
as r/j(6). Therefore, the optimum TZk{u) should be 'ifk{zu,k), 
where Zu,k is the b between Xu,k and yu.k and maximizing 
7^(fc), namely the maximizer of the right-hand side of (fTTI i. 
In this case, the maximum Ru.k.i and the optimum RA can 




(c) case 3 

Fig. 14. Illustrations of 7^ ^i^)' liik), and 7u(fc) versus Pa,ii(fc). 7u(fc) 
is highlighted by painting the area it bounds in light grey. 

TABLE II 
Coordinates of A, B, and C. 





A 


B 


C 




P 





P 


SNR 




Gnk(u),uik)P 


Gs,u{k)P 



be evaluated with the formulas for the third case shown in 
Section III.B. 
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